[Previous] [Next] [Index] [Thread]

Re: Site Scaning & IP graps



 >   Good spiders will ask for /robots.txt and find out what to do with 
 > themselves if they find it.
 > 
 >   Generally grepping for /robots.txt will give you a list of spiders that 
 > have found you.

The access log typically doesn't have the agent name, just the host name
that called for the file.  Some are obvious (fourteen.srv.lycos.com seems
to be visiting me today), others not (204.162.98.47 ??).

While it is possible to have all of the information in one log file with
the NCSA server, I'd bet most folks aren't using the feature.  I don't
know about other servers.

Dennis Boone
MSU CWIS Team


References: